Model Selection

Long text processing

# Long text processing

MiniCPM4 is an efficient large - language model designed specifically for edge devices. Through systematic innovation, it achieves extreme efficiency improvements in four key dimensions: model architecture, training data, training algorithm, and inference system.

Large Language Model

Transformers Supports Multiple Languages

MiniCPM4 is an efficient large language model designed specifically for edge devices. Through systematic innovation, it achieves extreme efficiency improvements in four dimensions: model architecture, training data, training algorithm, and inference system. It can achieve over 5 times faster generation speed on edge chips.

Large Language Model

Transformers Supports Multiple Languages

Qwen3-4B is the latest version in the Qwen series of large language models with 4B parameters, supporting switching between reasoning and non-reasoning modes, excelling at inference, instruction following, and multilingual processing.

Large Language Model

Qwen3-8B-AWQ is the latest generation of large language model with 8.2B parameters in the Tongyi Qianwen series, which uses AWQ 4-bit quantization technology to optimize inference efficiency. It supports the switching between thinking and non-thinking modes and has excellent reasoning, instruction-following, and intelligent agent capabilities.

Large Language Model

Qwen3 8B GPTQ Int4

Qwen3-4B is the latest large language model in the Qwen series, featuring the ability to switch thinking modes, powerful reasoning capabilities, excellent human preference alignment, outstanding agent capabilities, and multilingual support.

Large Language Model

Gemma 3 R1984 27B Q6 K GGUF

GGUF format model converted from VIDraft/Gemma-3-R1984-27B, supporting multilingual text generation

Large Language Model Supports Multiple Languages

Reranker ModernBERT Large Gooaq Bce

This is a cross-encoder model fine-tuned from ModernBERT-large, used to calculate the scores of text pairs, suitable for text re-ranking and semantic search tasks.

Text Embedding English

Croguana RC2 Gguf

Croatian text generation model based on Mistral architecture, trained with Unsloth acceleration

Large Language Model Other

Qwen2.5 QwQ 35B Eureka Cubed

Enhanced version of QwQ-32B, suitable for all usage scenarios, with outstanding reasoning and output capabilities.

Large Language Model

Transformers Other

Thor V2.5 8b FANTASY FICTION 128K Q4 K M GGUF

This is a GGUF-format converted 8B-parameter language model specialized for fantasy fiction, supporting 128K context length.

Large Language Model English

Frigg V2 8b ACADEMIC 128K Q4 K M GGUF

Frigg-v2-8b-ACADEMIC-128K-Q4_K_M-GGUF is an 8B-parameter large language model in GGUF format, suitable for various text generation tasks.

Large Language Model English

Longwriter V 72B

A multimodal large model fine-tuned on the LongWriter-V-22K dataset based on Qwen2.5-VL-72B-Instruct

L3.3 Cu Mai R1 70b

A 70B-parameter large language model based on the Llama3 architecture, specially optimized

Large Language Model

Italian ModernBERT Base

Italian ModernBERT is a specialized version of ModernBERT for Italian language, pre-trained specifically on Italian text.

Large Language Model

Transformers Other

Modernbert Base Ita

ModernBERT is a modern bidirectional encoder-only Transformer model (BERT-style), pre-trained on 2 trillion tokens of English and code data, with a native context length of up to 8,192 tokens.

Large Language Model

Transformers Supports Multiple Languages

KURE-v1 is an embedding model specifically optimized for Korean text retrieval, fine-tuned based on BAAI/bge-m3, and excels in Korean retrieval tasks.

A sentence embedding model fine-tuned on the Korean triplet dataset based on the Alibaba-NLP/gte-multilingual-base model for semantic similarity calculation

Text Embedding Supports Multiple Languages

This is a sentence-transformers model fine-tuned on a Korean triplet dataset based on Alibaba NLP/gte-multilingual-base, designed for semantic textual similarity tasks.

Text Embedding Supports Multiple Languages

Granite 3.0 3b A800m Instruct

A 3-billion parameter instruction-tuned language model developed by IBM, based on Granite-3.0 architecture, supporting multilingual tasks and commercial applications

Large Language Model

This model is a large language model fine-tuned based on Qwen-2 72B Instruct, aiming to replicate the prose quality of the Claude 3 series of models and is the seventh version in the series of models.

Large Language Model

Safetensors Supports Multiple Languages

Gte Base Korean

A Korean sentence embedding model fine - tuned on Alibaba - NLP/gte - multilingual - base, supporting tasks such as semantic text similarity calculation and semantic search.

Jais Family 1p3b Chat

Jais series 1.3 billion parameter Arabic-English bilingual large language model, optimized for exceptional Arabic capabilities while maintaining strong English proficiency

Large Language Model Supports Multiple Languages

Transformer encoder pretrained based on Megatron-LM, specifically designed for Japanese scenarios

Large Language Model

Transformers Supports Multiple Languages

Llama 2 7b Ukrainian

Llama-2-7b-Ukrainian Version is a bilingual pre-trained model supporting Ukrainian and English, based on continued pre-training of Llama-2-7b using 5 billion tokens of data from CulturaX.

Large Language Model

Transformers Supports Multiple Languages

Turkish Llama 8b V0.1 GGUF

Turkish-Llama-8b-v0.1 is a fully fine-tuned Turkish text generation model based on LLaMA-3 8B, trained on a 30GB Turkish dataset.

Large Language Model Other

Yi 1.5 34B Chat 16K

Yi-1.5 is an upgraded version of the Yi model, demonstrating superior performance in programming, mathematics, reasoning, and instruction-following capabilities.

Large Language Model

Yi-1.5 is an upgraded version of the Yi model, excelling in programming, mathematics, reasoning, and instruction-following capabilities while maintaining outstanding language understanding, commonsense reasoning, and reading comprehension.

Large Language Model

Mlong T5 Tglobal Base Et Riigikogu Summary

This is an Estonian text summarization model based on the T5 architecture, specifically designed for summarizing stenographic records of the Estonian Parliament discussions.

Text Generation

Transformers Other

360zhinao 7B Base

360 Zhinao is an open-source large language model series developed by Qihoo 360, including base models and dialogue models with various context lengths, supporting both Chinese and English.

Large Language Model

Transformers Supports Multiple Languages

Mosaicml Mpt 7b Storywriter Bnb 4bit Smashed

PrunaAI's compressed MPT-7B story-writing model, enabling efficient inference through llm-int8 technology

Large Language Model

Transformers Other

Bge M3 Zeroshot V2.0

A model specifically designed for efficient zero-shot classification, supporting multilingual text classification tasks without requiring training data

Text Classification

Transformers Other

Bge M3 Zeroshot V2.0 C

A multilingual zero-shot text classification model trained based on BAAI/bge-m3-retromae, specifically designed for commercial-friendly scenarios

Text Classification

Transformers Other

Rubert Mini Sts

This is a base BERT model for computing compact embedding vectors of Russian sentences, developed based on cointegrated/rubert-tiny2, with the number of layers increased from 3 to 7.

Transformers Other

Qra is a series of Polish-optimized large language models jointly developed by the Polish National Information Processing Institute and Gdańsk University of Technology, initialized based on TinyLlama-1.1B and trained on 90 billion Polish tokens

Large Language Model

Ruropebert Classic Base 512

A Russian encoder model based on the RoPEBert architecture, trained using cloning methods, supports 512-token context, and surpasses the original ruBert-base model in quality

Large Language Model

Transformers Other

polka-1.1b is a bilingual (Polish and English) text generation model enhanced by continuing pre-training on 5.7 billion Polish tokens based on the TinyLlama-1.1B model.

Large Language Model

Transformers Supports Multiple Languages

An experimental fine-tuned model based on yi-34b-200k, suitable for creative writing, role-playing, and other tasks, without DPO stage applied.

Large Language Model

Law LLM 13B GGUF

Law LLM 13B is a specific domain foundation model developed based on LLaMA-1-13B, focusing on tasks in the legal domain.

Large Language Model

Transformers English

Bce Reranker Base V1

A bilingual and cross-language reranking model optimized for RAG, supporting Chinese, English, Japanese, and Korean, providing explainable absolute scores

Transformers Supports Multiple Languages

Titulm Mpt 1b V1.0

TituLM-1B-BN-V1 is a large language model specifically trained for generating and understanding Bengali text, extensively trained on a dataset containing 4.51 billion Bengali tokens.

Large Language Model

Transformers Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase